Goto

Collaborating Authors

 cell state


RecurrentQuantumNeuralNetworks

Neural Information Processing Systems

With applied quantum computing in its infancy, there already exist quantum machine learning models such as variational quantum eigensolvers which have been used e.g. in the context of energy minimization tasks.


Attention-based Neural Cellular Automata

Neural Information Processing Systems

Recent extensions of Cellular Automata (CA) have incorporated key ideas from modern deep learning, dramatically extending their capabilities and catalyzing a new family of Neural Cellular Automata (NCA) techniques. Inspired by Transformer-based architectures, our work presents a new class of NCAs formed using a spatially localized--yet globally organized--self-attention scheme. We introduce an instance of this class named .


Fast-Slow Recurrent Neural Networks

Asier Mujika, Florian Meier, Angelika Steger

Neural Information Processing Systems

The FS-RNN incorporates the strengths of both multiscale RNNs and deep transition RNNs as it processes sequential data on different timescales and learns complex transition functions from one time step to the next.


SOH-KLSTM: A Hybrid Kolmogorov-Arnold Network and LSTM Model for Enhanced Lithium-Ion Battery Health Monitoring

Jarraya, Imen, Atitallah, Safa Ben, Alahmeda, Fatimah, Abdelkadera, Mohamed, Drissa, Maha, Abdelhadic, Fatma, Koubaaa, Anis

arXiv.org Artificial Intelligence

Lithium (Li) batteries have emerged as a dominant energy storage solution due to their exceptional energy density, prolonged cycle life, fast charging capability, and adaptability across diverse applications, including electric vehicles, renewable energy systems, and portable electronics [1, 2, 3]. However, their performance inevitably degrades with time driven by repeated charge and discharge cycles, temperature fluctuations, and ageing effects [4, 5]. This degradation not only reduces battery efficiency and reliability but also poses significant safety risks, particularly in high-demand applications where performance consistency is critical [6], [7]. As a result, accurate estimation of the State of Health (SOH) is essential to ensure the longevity and safe operation of Li batteries. SOH is a key indicator of the remaining capacity and functional integrity of a battery relative to its initial state. It encompasses key variables such as voltage, current, temperature, and other factors that influence battery performance.


Rethinking Self-Replication: Detecting Distributed Selfhood in the Outlier Cellular Automaton

Hintze, Arend, Bohm, Clifford

arXiv.org Artificial Intelligence

Spontaneous self-replication in cellular automata has long been considered rare, with most known examples requiring careful design or artificial initialization. In this paper, we present formal, causal evidence that such replication can emerge unassisted -- and that it can do so in a distributed, multi-component form. Building on prior work identifying complex dynamics in the Outlier rule, we introduce a data-driven framework that reconstructs the full causal ancestry of patterns in a deterministic cellular automaton. This allows us to rigorously identify self-replicating structures via explicit causal lineages. Our results show definitively that self-replicators in the Outlier CA are not only spontaneous and robust, but are also often composed of multiple disjoint clusters working in coordination, raising questions about some conventional notions of individuality and replication in artificial life systems.


Transforming Credit Risk Analysis: A Time-Series-Driven ResE-BiLSTM Framework for Post-Loan Default Detection

Yang, Yue, Lin, Yuxiang, Zhang, Ying, Su, Zihan, Goh, Chang Chuan, Fang, Tangtangfang, Bellotti, Anthony Graham, Lee, Boon Giin

arXiv.org Artificial Intelligence

Prediction of post-loan default is an important task in credit risk management, and can be addressed by detection of financial anomalies using machine learning. This study introduces a ResE-BiLSTM model, using a sliding window technique, and is evaluated on 44 independent cohorts from the extensive Freddie Mac US mortgage dataset, to improve prediction performance. The ResE-BiLSTM is compared with five baseline models: Long Short-Term Memory (LSTM), BiLSTM, Gated Recurrent Units (GRU), Convolutional Neural Networks (CNN), and Recurrent Neural Networks (RNN), across multiple metrics, including Accuracy, Precision, Recall, F1, and AUC. An ablation study was conducted to evaluate the contribution of individual components in the ResE-BiLSTM architecture. Additionally, SHAP analysis was employed to interpret the underlying features the model relied upon for its predictions. Experimental results demonstrate that ResE-BiLSTM achieves superior predictive performance compared to baseline models, underscoring its practical value and applicability in real-world scenarios.


Uncovering smooth structures in single-cell data with PCS-guided neighbor embeddings

Ma, Rong, Li, Xi, Hu, Jingyuan, Yu, Bin

arXiv.org Machine Learning

Single-cell sequencing is revolutionizing biology by enabling detailed investigations of cell-state transitions. Many biological processes unfold along continuous trajectories, yet it remains challenging to extract smooth, low-dimensional representations from inherently noisy, high-dimensional single-cell data. Neighbor embedding (NE) algorithms, such as t-SNE and UMAP, are widely used to embed high-dimensional single-cell data into low dimensions. But they often introduce undesirable distortions, resulting in misleading interpretations. Existing evaluation methods for NE algorithms primarily focus on separating discrete cell types rather than capturing continuous cell-state transitions, while dynamic modeling approaches rely on strong assumptions about cellular processes and specialized data. To address these challenges, we build on the Predictability-Computability-Stability (PCS) framework for reliable and reproducible data-driven discoveries. First, we systematically evaluate popular NE algorithms through empirical analysis, simulation, and theory, and reveal their key shortcomings, such as artifacts and instability. We then introduce NESS, a principled and interpretable machine learning approach to improve NE representations by leveraging algorithmic stability and to enable robust inference of smooth biological structures. NESS offers useful concepts, quantitative stability metrics, and efficient computational workflows to uncover developmental trajectories and cell-state transitions in single-cell data. Finally, we apply NESS to six single-cell datasets, spanning pluripotent stem cell differentiation, organoid development, and multiple tissue-specific lineage trajectories. Across these diverse contexts, NESS consistently yields useful biological insights, such as identification of transitional and stable cell states and quantification of transcriptional dynamics during development.


No Foundations without Foundations -- Why semi-mechanistic models are essential for regulatory biology

Kovačević, Luka, Gaudelet, Thomas, Opzoomer, James, Triendl, Hagen, Whittaker, John, Uhler, Caroline, Edwards, Lindsay, Taylor-King, Jake P.

arXiv.org Artificial Intelligence

Despite substantial efforts, deep learning has not yet delivered a transformative impact on elucidating regulatory biology, particularly in the realm of predicting gene expression profiles. Here, we argue that genuine "foundation models" of regulatory biology will remain out of reach unless guided by frameworks that integrate mechanistic insight with principled experimental design. We present one such ground-up, semi-mechanistic framework that unifies perturbation-based experimental designs across both in vitro and in vivo CRISPR screens, accounting for differentiating and non-differentiating cellular systems. By revealing previously unrecognised assumptions in published machine learning methods, our approach clarifies links with popular techniques such as variational autoencoders and structural causal models. In practice, this framework suggests a modified loss function that we demonstrate can improve predictive performance, and further suggests an error analysis that informs batching strategies. Ultimately, since cellular regulation emerges from innumerable interactions amongst largely uncharted molecular components, we contend that systems-level understanding cannot be achieved through structural biology alone. Instead, we argue that real progress will require a first-principles perspective on how experiments capture biological phenomena, how data are generated, and how these processes can be reflected in more faithful modelling architectures.


Machine Learning and Deep Learning Techniques used in Cybersecurity and Digital Forensics: a Review

Fattahi, Jaouhar

arXiv.org Artificial Intelligence

In the paced realms of cybersecurity and digital forensics machine learning (ML) and deep learning (DL) have emerged as game changing technologies that introduce methods to identify stop and analyze cyber risks. This review presents an overview of the ML and DL approaches used in these fields showcasing their advantages drawbacks and possibilities. It covers a range of AI techniques used in spotting intrusions in systems and classifying malware to prevent cybersecurity attacks, detect anomalies and enhance resilience. This study concludes by highlighting areas where further research is needed and suggesting ways to create transparent and scalable ML and DL solutions that are suited to the evolving landscape of cybersecurity and digital forensics.


Improvement and Implementation of a Speech Emotion Recognition Model Based on Dual-Layer LSTM

Yang, Xiaoran, Yu, Shuhan, Xu, Wenxi

arXiv.org Artificial Intelligence

This paper builds upon an existing speech emotion recognition model by adding an additional LSTM layer to improve the accuracy and processing efficiency of emotion recognition from audio data. By capturing the long-term dependencies within audio sequences through a dual-layer LSTM network, the model can recognize and classify complex emotional patterns more accurately. Experiments conducted on the RAVDESS dataset validated this approach, showing that the modified dual layer LSTM model improves accuracy by 2% compared to the single-layer LSTM while significantly reducing recognition latency, thereby enhancing real-time performance. These results indicate that the dual-layer LSTM architecture is highly suitable for handling emotional features with long-term dependencies, providing a viable optimization for speech emotion recognition systems. This research provides a reference for practical applications in fields like intelligent customer service, sentiment analysis and human-computer interaction.